Quantum Process Tomography via ^i-norm Minimization 
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For an initially well designed but imperfect quantum information system, the process matrix is almost sparse 
in an appropriate basis. Existing theory and associated computational methods (£i-norm minimization) for 
reconstructing sparse signals establish conditions under which the sparse signal can be perfectly reconstructed 
from a very limited number of measurements (resources). Although a direct extension to quantum process 
tomography of the i?i-norm minimization theory has not yet emerged, the numerical examples presented here, 
which apply £i-norm minimization to quantum process tomography, show a significant reduction in resources 
to achieve a desired estimation accuracy over existing methods. 



Quantum process tomography (QPT) refers to the use of 
measured data to estimate the dynamics of a quantum system 
ill, |2i] . Unfortunately, in the general case, the dimension of 
the parameter space for QPT can be prohibitive, scaling ex- 
ponentially with the number of qubits. This in turn places the 
same burden on resources, e.g., the number of applied inputs, 
measurement outcomes, and experiments to achieve a desired 
accuracy, as well as estimation computational complexity. A 
number of approaches have been developed to alleviate this 
burden. Of note are the various forms of ancilla assisted 
QPT (see [3] for a review), and the use of symmetrisation 
to estimate selected process properties Here we present 
a method which can be used either alone or in conjunction 
with any of the aforementioned approaches. The underlying 
premise is that for an intially well engineered design, the ob- 
ject that describes the quantum dynamics, the process matrix, 
will be almost sparse in the appropriate basis. Certainly in the 
ideal case of a perfect unitary channel, in the corresponding 
ideal basis, the process matrix is maximally sparse, i.e., it has 
a single non-zero element. Since environmental interactions 
cannot be totally eliminated, the actual process matrix in this 
ideal basis will be populated with many small elements, and 
thus, is almost sparse. 

These are the conditions under which methods using ti- 
norm minmization - often referred to as Compressive Sens- 
ing - are applicable IH Specifically, for a class of 
incomplete linear measurement equations {y = Ax, A E 
p^mxn ^ ^ ^-j^ constrained ^i-norm minimization (mini- 
mize subject to y = Ax), a convex optimization prob- 
lem, can perfectly estimate the sparse variable x. These meth- 
ods also work very well for systems which do not satisfy the 
theoretical conditions, i.e., for almost sparse variables and 
with measurement noise. 

The underlying theory of £i minimization shows that under 
certain conditions on the matrix A, to realize perfect recovery, 
the number of measurements, m, scales with the product of 
the log of the number of variables n and the sparsity. Since 
QPT parameters are linear in probability outcomes, and scale 
exponentially with the number of qubits, this approach heralds 
a possible linear scaling with qubits. The theory, however, has 
not as yet been extended to QPT. The numerical examples here 
are not meant to lend support to this scaling as they are only 
presented for the two-qubit case. The examples do, however, 
show more than an order of magnitude savings in resources 
over a standard constrained least-squares estimation using a 



complete set of measurements, i.e., rank(yl) > n. 

The paper is organized as follows: QPT formalism is de- 
scribed next, followed by a discussion of the genesis of pro- 
cess matrix (almost) sparsity. A form of the li minimization 
for QPT is then presented followed by numerical examples 
and some concluding remarks. 

QPT Formalism. — Recall that the state-to-state dynamics 
of an open finite-dimensional quantum system can be de- 
scribed in the following canonical form 1 1]: 
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where p, p E C"^" are the input and output state, respec- 
tively, of dimension n, Xa/i are the elements of the x 
process matrix X, and the matrices Fq form an orthonormal 
basis set for n x n complex matrices: 



{r„ e C"><" |Tr rt = s^p, a,/? = i, 



} (2) 



It is assumed that the quantum system to be estimated is com- 
pletely positive and trace preserving (CPTP). The set of feasi- 
ble process matrices is then restricted to the convex set L8j,i^, 



(3) 



X > (positive semidefinite) 
It follows from Q that the number of real parameters in the 



process matrix is n 



. For q qubits n — 2'^, hence, scaling 



with parameters is exponential in the number of qubits. 

Collecting data. — A common method for collecting data 
from a quantum system is via repeated identical experiments. 
Denote hy i = 1, . . . , riout the distinct outcomes, and by 
k — 1; . . . , ricfg the experimental configurations, e.g., any 
"knobs" associated with state inputs and/or measurement de- 
vices. The measurement outcomes are recorded from iden- 
tical experiments in each configuration k repeated Nk times. 
Let Nik denote the number of times out of Nk that outcome 
i occurred in configuration k. The QPT data are the recorded 
outcome counts. 



ncfg} (4) 
ik is the total num- 



where TV = ES^^^ = ESE.=i 
ber of experiments. 

Estimating the process matrix. — An empirical estimate of 

the probability of measuring outcome i in configuration k can 

be obtained from (|4|i as. 
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From the Born Rule the model probability of outcome i given 
configuration k with observable Mi^ is, — Tr Mn^p/^, 

where from O, pk = Ea,/3=i Xai3TaPk'^l- In terms of the 
process matrix X, the Born rule then becomes, 

p,k{X) = TrG.kX 

{G,k)a.0 = TrrlM^kT^Pk ^' 

The riout'T-ctg matrices Gifc e C"^" capture the effect of mea- 
surements in the matrix basis set (|2]i. For each outcome i, the 
complete set of configurations is the combination of all these 
matrices and the input states: {pk, Gjfc}^'!!^. 

A process matrix estimate can be obtained by minimiz- 
ing the difference between the empirical probability estimates 
p™^ and the model probabilities pik{X) subject to the feasi- 
bility constraint (O. Using a "least-squares" measure of prob- 
ability error leads to estimating the process matrix by solving 
the optimization problem: 

minimize Fls(X) = T,^,k ipT^ " Pik{X)f 
subject to X satisfies ([3]) 

Because the outcomes of each experiment are independent, a 
maximum likelihood approach can also be considered, i.e., 

minimize Vmi.{X) = -Y.i,k^it^'^'^&Pik{X) 
subject to X satisfies ^ 

Both O and ([8]l are convex optimization problems with the 
optimization variables being the elements of X IHI^. The 
resulting solution (estimate) will always be CPTP (O. Unfor- 
tunately, as already mentioned, the dimension of the parame- 
ter space (n^ — -n? ,n = 2'') can severely strain resources to 
the point of impracticality. To see this more clearly, let the 
linear relation in (|6]l between the rioutf^cfg model probability 
outcomes and the elements of the process matrix be repre- 
sented by an riout"-cfg x matrix Q, i.e., 

P^gX (9) 

where p, X are vectors formed from the pik and elements of 
X, respectively. Accounting for the -n? linear constraints in 
(|3]l, X can be recovered from either (|7]i or dHJ to within any 
desired accuracy by using enough data {N in (HJl sufficiently 
large), provided that rank(^) > riout"-cfg > — n?. There- 
fore it would seem that the resources, nout'T-cfg, must also 
scale exponentially with the number of qubits. This, however, 
is not the case when the process matrix is almost sparse and 
where the sparsity pattern is not known ITtI . 

Almost sparsity of the process matrix. — With no noise 
the ideal channel p ^ p for a quantum information sys- 
tem is a unitary, i.e., p = U pU'^ . Let {f „ e C""="}f^i 
denote the "Natural-Basis" for matrices in C"^", i.e., each 
basis matrix has a single non-zero element of one. In this 
basis, the process matrix associated with the ideal unitary 
channel has the rank-1 form, Xjdcai — xx^ with x £ 
C" , x'^x = n. A singular value decomposition (SVD) gives 
^idoai = Vdmg{n, 0, . . . , 0)^^ with V £ C"^" a unitary. 
An equivalent process matrix can be formed from the SVD 



|X| in Natural-Basis |X| in Ideal/SVD-Basis 
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FIG. 1: Absolute values of the elements of the process matrix X £ 
C^B^'^B for: (a) ideal in the Natural-Basis; (b) ideal in Ideal/SVD- 
Basis; (c) actual (pm = 0.05) in Natural-Basis, (d) actual (pbf = 
0.05) in Ideal/SVD-Basis;(e) actual (pbf = 0.2) in Natural-Basis, (f) 
actual (pbf = 0.2) in Ideal/SVD-Basis. 

in what is referred to here as the "Ideal/SVD-Basis," {Fq = 

2 _ 2 

J2a'=i^'^'a^o^' ^ C"^"}JJ^j, The equivalent process ma- 
trix, in this basis, denoted by Xi^ced, is maximally sparse with 
a single non-zero element, specifically, (Xidcai)ii — n. As 
will always be the case, the actual channel will be a perturba- 
tion of the ideal unitary. If the noise source is small then the 
process matrix in the nominal basis will be almost sparse. 

Example: Noisy two-qubit memory. — Consider a system 
which is ideally a two-qubit quantum memory, thus U = 
l4,n = 4. Suppose the actual system is a perturbation of 
identity by independent bit-flip errors in each channel occur- 
ring with probability pbf- Forpbt ~ 0.05 andpbf ~ 0.2, the 
respective channel fidelities are about 0.90 and 0.64, which for 
quantum information processing would need to be discovered 
by QPT and then corrected for the device to ever work. Refer- 
ring to FiglH in the Natural-Basis, FigUJa), the ideal 16 x 16 
process matrix has 16 non-zero elements out of 256, all of 
magnitude one. Using the Ideal/SVD-Basis the correspond- 
ing process matrix as shown in Figlllb) has a single non-zero 
element of magnitude n = 4 - it is clearly maximally sparse. 
Figinc)-(d) and (e)-(f), respectively, show the effect of the 
two Pbf levels in the two basis sets. In the Ideal/SVD-basis 
FiglTJd) and (f) show that the actual (noisy) process matrices 
are almost sparse. 

Sparsity minimization. — A known heuristic for minimizing 
sparsity without knowing the sparsity pattern, and also accru- 
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ing the benefit of using fewer resources, is to minimize the £i- 
norm of the vector of variables li5fc^^. For QPT the equiva- 
lent £i norm is defined here as the sum of the absolute values 
of the real and imaginary parts of each element of the process 
matrix. There are many related approaches to incorporate this 
norm. For example, an estimate of X can be obtained by solv- 
ing the following convex optimization problem: ifisl 

2 

minimize = J2l,0=ii\^^ ^apl + {ImXapl) 

subject to V{X) < a,X satisfies (O 

(10) 

with, e.g., V{X) from (|7]i or (O. The optimization parameter 
a is used to regulate the tradeoff between fitting X to the data 
by minimizing V{X)vs. minimizing the sparsity of X via the 
£i-norm. Selecting a is often done by averaging V{X) over a 
series of surrogates for X obtained from anticipated scenarios 
or iterating estimation and experiment design, e.g., [8]. 

In the examples to follow we use the modification of ( fTOl i 
suggested in LTJ, referred to there as "^i-reweighted mini- 
mization." In this approach a weighted £i-norm is used with 
the weights determined iteratively. The algorithm described 
in H is: 

Initialize a > 0, e > 0, W = I^^ 
Repeat 

1. Solve for X 



minimize ||T4^X||^^ 

subject to V{X) < <7,X satisfies Q 

2. Update weights 

W = diag(l/(|xi| + e),...,l/(|x„4|+e)) 
X ^ X 



(11) 



(12) 



Until convergence - the objective stops decreasing or a max- 
imum number of iterations is reached. 

In each of the examples to follow the procedure for QPT is: 
(i) solve (|7]l to obtain Xe^; (ii) set cr — 1.3 VlXg^)', (iii) solve 
the reweighting algorithm (fTTI)-(fT2b for Xi-^ . 

Example: QPT of noisy two-qubit memory. — For the sys- 
tems from the example in FigUl the inputs and measurements 
are selected from the set of two-qubit states: \a), | + ) = 
{\a) + \h))/V2, I-) = (|a)-z|&))/V2witha,6= 1,...,16. 
Specifically, the available set of states are the 16 columns of 
the matrices. 
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Considering only coincident input/measurement counts 
the relevant probability outcomes (|6]l are, 

Pab{x) = albXgab, X e ci^xie 

{gab)a = (l>\Ya(l>b, « = 1, . . . , 16 



(14) 



with 4>a, (f^b {o-,b) £ {1, . . . , 16} the selected columns of ( fT3] l. 

Fig|2] shows the error in estimating the process matrix 
AX = Xtruo — -'^cst as measured by the RMS matrix norm 



^ ^ ^_ 

5 50 500 

experiments per configuration (xlO^) 



(a)pbf = 0.05 



t 10" 



5 50 500 

experiments per configuration (xlO^) 

(b) pbf = 0.20 

FIG. 2: RMS estimation error ||Xtruc — ^cst||n-„s vs. number of 
experiments per configuration: selected columns of l ll3b . Error bars 
show the deviation from 50 runs at each setting. 
^2 -minimization (□): X^st = is from Q using all 16 in- 
put/output combinations. This gives a matrix Q £ (^256x256 
fined in (|9} which is full rank, i.e., rank(t/) = 256. 
^1 -minimization (0): Xcst ~ Xe-^ is from (Illb-ill2b using 6 in- 
puts and 6 measurements obtained from the columns of the second 

^ which is full rank, i.e.. 



matrix in J13l >. 
rank(C/) = 36. 



This gives t/ G 



||AX||^,^^^ = (l/«)(Tr AXtAX)i/2 vs. the number of ex- 
periments per input selected from the set ( fTSI ) [19]. The results 
shown are from simulations described in the caption. 

The benefit of -minimization compared to the standard 
£2 -minimization is seen most clearly with small amounts of 
data from highly incomplete measurements. For example, for 
Pbf = 0.05 [Fig|2a)], at 50 x 10'' experiments per input for 
the 6-input/6-output configuration ((? e c36x256) fj^g £^ 
estimation error is 0.0019. Compare this to the £2 error of 
0.0012 at 500 x 10'^ experiments per input for the 16-input/16- 
output configuration (tj e C^^^^^^^). The latter improvement 
can be attributed mostly to the 10-fold increase in the number 
of experiments per input. The additional resources to achieve 
this are significant, i.e., 16 inputs for £2 vs. 6 for £1, and 
additionally, an increase in the total number of experiments 
from 6 X 50 X 10'^ to 16 x 500 x lO"'. It is certainly not in- 
tuitive that to estimate the 240 parameters of the process ma- 
trix, the clearly incomplete set of measurements using only 36 
outcomes (0 in Fig|2|i could produce results not only similar 
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to, but for each number of experiments per input, even better 
than the full input case with all 256 combinations of inputs 
and measurements (□ in Fig|2]i. As seen the £i error is about 
1/2 the £2 error Also, reweighting reduced the (unweighted) 
ii eiTorby 1/2-1/3. 

Comparing the estimation errors with the error between the 
actual and ideal (solid lines in Fig|2]i suggests that at least 
50 X 10'^ experiments per input are needed to achieve a suf- 
ficient post-QPT error correction towards the ideal unitary. 
Fig|2]also reveals that the estimation errors are very similar 
for both levels of bit-flip error, pbf G {0.05, 0.20}. This is ex- 
plained by the Cramer-Rao bound which defines the asymp- 
totic error of any unbiased estimator, i.e., the RMS decays as 
A / ^/N. Here A is effectively the error between the empirical 
Q and actual ^ probabilities which by definition is of order 
one; this provides a reasonable fit to the data in Fig|2] 

Infinite data. — With infinite data the measurements are 
effectively noise-free, so the empirical probability estimates 
are equivalent to the true probabilities. Infinite data esti- 
mates are obtained by solving (|7| and (fTTli-(fT2]l with the con- 
straint V{X) < a replaced by the linear equality constraint 
Pik{X) — Pife(Xti.uc)- For the numerical examples here, ( fT4l i 
gives the linear equality g\i,{X - Xtruc)gab = 0. 

In the examples, both Xi-^ from (fTTTl-(fT2]i and Xi^ from 
(|2]l were numerically equal to Xtmc- This is to be expected 
for Xi^ because of the complete set of 256 full rank mea- 
surements. Almost sparsity makes perfect estimation possible 
with the highly incomplete set of 36 measurements. 

The infinite data case is useful for evaluating different con- 
figuration strategies in simulation, i.e., consider only those 
that result in a good estimate. 

To stress the efficacy of £1 -minimization as a heuristic for 
sparsity, consider replacing the £1 norm in (fTTT i- lfTSl i with the 



RMS norm |l^|l,.,„s, which is effectively the £2 norm of X. 
Solving the 6-input/6-output case (0 in Fig|2]l for pbf — 0.05 
with infinite data gives an RMS error of 0.11, which is con- 
siderably larger than the error between the actual and ideal 
of 0.03 (solid line in Fig|2a)). The estimate gets even worse 
with finite data. This again emphasizes the advantage of £1 
minimization for sparse signal reconstruction |5, 6]. 

Conclusions. — The use of the ^i-norm minimization meth- 
ods of Compressive Sensing iJ, 0] appear to apply equally 
well to sparse QPT. The examples of sparse process matrices 
presented here are meant to represent typical initial imperfect, 
designs. The numerical results illustrate how estimation re- 
source tradeoffs can be obtained. Additionally, the findings 
suggest that QPT resources need not scale exponentially with 
qubits. In the ideal case, the theoretical question of showing 
linear scaling with sparsity using £1 minimization for QPT re- 
mains open. 

Because £1 minimization uses considerably fewer resources 
than standard QPT, use in an on-line setting combined with 
optimal quantum error correction tuned to the specific QPT 
errors is compelling, e.g., ifTTl [T2I [Tsll . Another future direc- 
tion is in conjunction with Hamiltonian parameter estimation. 
Here a bank of estimators can be applied to the data where 
each estimator is tuned via the Ideal/SVD-Basis to one of a 
number of finite samples of the unknown parameters. Such an 
approach may prove useful for a small number of parameters. 
In quantum metrology often a single uncertain parameter is to 
be estimated in an unknown noisy environment, e.g., ifTilfTsll . 
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